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MINIMUM DISTANCE REGRESSION MODEL CHECKING 
WITH BERKSON MEASUREMENT ERRORS 

By Hira L. Koul'^ and Weixing Song 

Michigan State University and Kansas State University 

Lack-of-fit testing of a regression model with Berkson measure- 
ment error has not been discussed in the literature to date. To fill 
this void, we propose a class of tests based on minimized integrated 
square distances between a nonparametric regression function esti- 
mator and the parametric model being fitted. We prove asymptotic 
normality of these test statistics under the null hypothesis and that of 
the corresponding minimum distance estimators under minimal con- 
ditions on the model being fitted. We also prove consistency of the 
proposed tests against a class of fixed alternatives and obtain their 
asymptotic power against a class of local alternatives orthogonal to 
the null hypothesis. These latter results are new even when there 
is no measurement error. A simulation that is included shows very 
desirable finite sample behavior of the proposed inference procedures. 

1. Introduction. A classical problem in statistics is to use a vector X 
of d-dimensional variables, d> 1, to explain the one-dimensional response 
Y. As is the practice, this is often done in terms of the regression func- 
tion := E(Y\X = x),x G W^, assuming it exists. Usually, in practice the 
predictor vector X is assumed to be observable. But in many experiments, 
it is expensive or impossible to observe X. Instead, a proxy or a manifest 
Z of X can be measured. As an example, consider the herbicide study of 
Rudemo, Ruppert and Streibig [16] in which a nominal measured amount 
Z of herbicide was applied to a plant but the actual amount absorbed X 
by the plant is unobservable. As another example, from Wang [20], an epi- 
demiologist studies the severity of a lung disease, Y, among the residents in 
a city in relation to the amount of certain air pollutants, X. The amount 
of the air pollutants Z can be measured at certain observation stations in 
the city, but the actual exposure of the residents to the pollutants, X , is 
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unobservable and may vary randomly from the Z-values. In both cases, X 
can be expressed as Z plus a random error. There are many similar exam- 
ples in agricultural or medical studies; see, for example, Carroll, Ruppert 
and Stefanski [5] and Fuller [10], among others. All these examples can be 
formalized into the so-called Berkson model 

(1.1) Y = ^l{X) + e, X = Z + r], 

where r] and £ are random errors with Ee = 0, 77 is d-dimensional and Z is 
the observable d-dimensional control variable. All three r.v.'s £,r] and Z are 
assumed to be mutually independent. 

Let M := {m0{x) :x S G 6 C M'}, g > 1, be a class of known func- 
tions. The parametric Berkson regression model where fi€ Ai has been the 
focus of numerous authors. Cheng and Van Ness [6] and Fuller [10], among 
others, discuss the estimation in the linear Berkson model. For nonlinear 
models, [5] and references therein consider the estimation problem by using 
a regression calibration method. Huwang and Huang [13] study the estima- 
tion problem when mg{x) is a polynomial in x of a known order and show 
that the least square estimators based on the first two conditional moments 
of Y, given Z, are consistent. Similar results are obtained in [19] and [20] 
for a class of nonlinear Berkson models. 

But literature appears to be scant on the lack-of-fit testing problem in 
this important model. This paper makes an attempt in filling this void. To 
be precise, with {Z,Y) obeying the model (1.1), the problem of interest here 
is to test the hypothesis 

TCq : /i(x) = mg^ (x) for some Oq £ Q and for all x £l; 
Til '■ Wo is not true, 

based on a random sample (Zj, 1^), 1 <i <n, from the distribution of (Z, Y), 
where Q and I are compact subsets of M.'^ and M*^, respectively. 

Interesting and profound results, on the contrary, are available for re- 
gression model checking in the absence of errors in independent variables; 
see, for example, [1, 11, 12] and references therein, [17, 18], among others. 
Koul and Ni [14] use the minimum distance methodology to propose tests of 
lack-of-fit of a parametric regression model in the classical regression setup. 
In a finite sample comparison of these tests with some other existing tests, 
they noted that a member of this class preserves the asymptotic level and 
has relatively very high power against some alternatives. The present paper 
extends this methodology to the above Berkson model. 

To be specific, Koul and Ni considered the following tests of TCq where 
the design is random and observable, and the errors are heteroscedastic. 



BERKSON MODEL DIAGNOSTICS 



3 



For any density kernel K, let Kh{x) := K{x/h)/h'^, h> 0,x G M*^. Define 
Ux) ■=^E]=i Klix - X,),w = Wn ~ (logn/n)V(rf+4), 

r\^JL 1^ 

Tn{e) :-- 



-J2Khix-Xj){Yj-7ne{Xj)) 



'dG{x) 



and On := argmin{r„(0), ^ G G}, where K,K* are density kernel functions, 
possibly different, h = hn and w = Wn are the window widths, depending on 
the sample size n, C is a compact subset of M"^ and G is a a-finite measure 
on C. They proved consistency and asymptotic normality of this estimator, 
and that the asymptotic null distribution of := nhn"^ {Tn{6n) — C^jv)!'^ 
is standard normal, where 

n „ 

a := / Kl(x - Xi)e\]-''(x) dG{x), = Y,- m,-jXi), 

Vn := h'^n-^ V f / Kh{x - Xi)Kh{x - Xj)iieJ-\x) dG{x))\ 

These results were made feasible by recognizing to use an optimal window 
width Wn for the estimation of the denominator and a different window width 
hn for the estimation of the numerator in the kernel-type nonparametric 
estimator of the regression function. A consequence of the above asymptotic 
normality result is that at least for large samples one does not need to use 
any resampling method to implement these tests. 

These findings thus motivate one to look for tests of lack-of-fit in the 
Berkson model based on the above minimized distances. Since the predictors 
in Berkson models are unobservable, clearly the above procedures need some 
modifications. 

Let fs, fx, frj-, fz denote the density functions of the r.v.'s in their sub- 
scripts and fjg denote the variance of e. In linear regression models if one is 
interested in making inference about the coefficient parameters only, these 
density functions need not be known. Berkson [3] pointed out that the ordi- 
nary least square estimators are unbiased and consistent in these models and 
one can simply ignore the measurement error r]. But if the regression model 
is nonlinear or if there are other parameters in the Berkson model that need 
to be estimated, then extra information about these densities should be sup- 
plied to ensure the identifiability. A standard assumption in the literature 
is to assume that is known or unknown up to a Euclidean parameter; 
compare [5, 13, 20], among others. For the sake of relative transparency of 
the exposition we assume that is known. 

To adopt the Koul and Ni (K-N) procedure to the current setup, we 
first need to obtain a nonparametric estimator of /i. Note that in the model 
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Fig. 1. 



(1.1), fxix) = J fz{z)fn{x - z) dz. For any kernel density K, let Khi{z) := 
Kh{z-Zi), fzh{z) = YA=iKhi{z)/naiidKh{x,z) := J Kh{z-y)fn{x-y)dy, 
for G M'^. It is then natural to estimate fx{x) and fJ-{x) by 



fxix) 



1 



-^Kh{x,Zi), Jnix):-- 



n 



Y.'i=iKh{x,Zi) 



A routine argument, however, shows that Jn{x) is a consistent estimator of 
J{x) ■.= E[H{Z)\X = x\, where H{z) := E[fj.{X)\Z = z], but not of /i(x). 

We include the following simulation study to illustrate this point. Consider 
the model Y = X"^ + e, X = Z + r], where e and r] are Gaussian r.v.'s with 
means zero and variances 0.01 and 0.05, respectively, and Z is a standard 
Gaussian r.v. Then J{x) = 0.0976 + 0.907x^. We generated 500 samples from 
this model, calculated Jn and then put all three graphs, Jn{x), fi{x) =x^, 
J{x) = 0.0976 + 0.907x^ into one plot in Figure 1. The curves with solid, 
dash-dot and dot lines are those of Jn, J{x) and ^{x) =x^, respectively. 

To overcome this difficulty, one way to proceed is as follows. Define 

He{z) :=E[me{X)\Z = z], Je{x) := E[He{Z)\X = x], 



(1.2) Qni0)= [ -J—J2Khix,Z,)Yi-Jeix) 
Jc nfxix) 



dG{x), 



QniO) 



c[nfx{x) ,=1 



J2Khix,Z^)[Y-HeiZ,)] 



dG{x) 



and On = argminggQ Qn{0),On = argmingge Q„(0). 
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Under some conditions, we can show that On, On are consistent for 
and asymptotic null distribution of a suitably standardized Qn{On) is the 
same as that of a degenerate [/-statistic, whose asymptotic distribution in 
turn is the same as that of an infinite sum of weighted centered chi-square 
random variables. Since the kernel function in the degenerate [/-statistic is 
complicated, computation of its eigenvalues and eigenfunctions is not easy 
and hence this test is hard to implement in practice. 

An alternative way to proceed is to use regression calibration as fol- 
lows. Because E{Y\Z) = H{Z), one considers the new regression model 
Y = H[Z) + C, where the error C, satisfies E{C,\Z) = 0. The problem of test- 
ing for TIq is now transformed to that of testing for H{z) = Hg^^z). This 
motivates the following modification of the K-N procedure that adjusts for 
not observing the design variable. Let 



1 



Y.UKm{z)Y, 

nfzw{z) 



z e . 



Note that Hn is an estimator of H{z) = E{ij,{X)\Z = z). Define 

2 



M*{0) 



1 



(1.3) 



MniO) 



nfzw{zj 
1 

^[nfzw{z) i=i 



Y,KUz)Yi-Hg{z) 



dG{z) 



J2Kh^{zm-HeiZi)] 



dG{z), 



0* = arg min M* (0), On= arg min Mn{0), 



where G is a cr-finite measure supported on I. We consider M„ to be the 
right analog of the above T„ for the Berkson model. This paper establishes 
consistency of ^* and On for and asymptotic normality of ^/n{On — Oq), 
under TIq . Additionally, we prove that the asymptotic null distribution of the 
normalized test statistic P„ := nh'^/'^Tn ^^'^ {Mn{On) — Cn) is standard normal, 
which, unlike the modification (1.2), can be easily used to implement this 
testing procedure, at least for the large samples. Here, 



(1.4) 



dil^iz) :-- 



Cn '■- 



dG{z) 
1 



Q:=Yi-H, (Zi), l<i<n, 



i=l 



J2 Kl{zKfdiJ.iz), 



J2 (J Khiiz)K„j{z)CiCjdi^{z)Y. 
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We note that a factor of 2 is missing in the analog of f .„ in K-N. 

Even though K-N conducted some convincing simulations to demonstrate 
the finite sample power properties of the P„-tests, they did not discuss any 
theoretical power properties of their tests. In contrast, we prove consistency 
of the proposed minimum distance (MD) tests against a large class of fixed 
alternatives and obtain their asymptotic power under a class of local alterna- 
tives. Let L2{G) denote the class of real- valued square integrable functions 
on with respect to G, 1^2) := /[^^i — ^2]^ dG, 1^1, 1^2 S L2{G) and 

(1.5) T{v):=aTgm.mp{u,Hg), ueL2{G). 

eee 

Let m G L2{G) be a given function. Consider the problem of testing TLq 
against the alternative 7ii: fi = m, m ^ Ai. Under assumption (m2) be- 
low and 7io, T{Hgg) = 6q, while under TCi, T{H) 7^ 9q, where now H{z) = 
E{m{X)\Z = z). Consistency of the Pn-test requires consistency of 6^ for 
T{H) only, while its asymptotic power properties against the local alterna- 
tives Hin '■ = mgQ + r /nh^^"^ requires that n^/^ {On -60) — Op{l), under Tiin- 
Here r is a continuously differentiable function with R{z) := E{r{X)\Z = z) 
such that R G L2{G) and / HgRdG = for all G 0. Under assumptions of 
Section 2 below, we show that under fii, 6n — > T{H), in probability, and un- 
der Hin, both n^/'^{en-9o) and Vn are asymptotically normally distributed. 

The paper is organized as follows. The needed assumptions are stated in 
the next section. All limits are taken as ^ 00, unless mentioned otherwise. 
Section 3 contains the proofs of consistency of 0* and On while Section 
4 discusses asymptotic normality of On and Vn, under TCq. The power of 
the MD-test for fixed and local alternatives is discussed in Section 5. The 
simulation results in Section 6 show little bias in the estimator On for all 
chosen sample sizes. The finite sample level approximates the nominal level 
well for larger sample sizes and the empirical power is high (above 0.9) for 
moderate to large sample sizes against the chosen alternatives. 

Finally, we mention that closely related to the Berkson model is the so- 
called errors- in- variable regression model in which Z = X + u. In this case 
also one can use the above MD method to test Ho, although we do not 
deal with this here. The biggest challenge is to construct nonparametric 
estimators of fx and Hg. The deconvolution estimators discussed in Fan 
[7, 8], Fan and Truong [9], among others, may be found useful here. 

2. Assumptions. Here we shall state the needed assumptions in this pa- 
per. In the assumptions below 6*0 denotes the true parameter value under 7io. 
About the errors, the underlying design and G we assume the following: 

(el) {{Zi,Yi) : Zi eM.'^,i = 1,2, ... ,n} are i.i.d. with H{z) := E{Y\Z = z) 
satisfying / dG < 00, where G is a u-finite measure on X. 
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(e2) < < oo, Eml^{X) < oo and the function t^(z) = E[{meo{X) - 
Hgg{Z))'^\Z = z] is a.s. (G) continuous on X. 

(e3) E\e\^+^ < oo, E\mg^{X) - Hg^{Z)\^+^ < oo, for some 5>0. 

(e4) E\e\^ < oo, E\meoiX) - He,,{Z)\^ < oo. 

(fl) fz is uniformly continuous and bounded from below on I. 

(f2) fz is twice continuously differentiable. 

(g) G has a continuous Lebesgue density g on X. 

About the bandwidth we shall make the following assumptions: 

(hi) hn^Q. 

(h2) nhl'^^oo. 

(h3) where < a <mm{l/ 2d, 4/ {d{d + i))). 

About the kernel functions K and K* we shall assume the following: 

(k) The kernel functions K, K* are positive symmetric square integrable 
densities on [—1,1]'^. In addition, K* satisfies a Lipschitz condition. 

About the parametric family {mg} we assume the following: 

(ml) For each 9, mg{x) is a.e. continuous in x w.r.t. the Lebesgue mea- 
sure. 

(m2) The function Hq[z) is identifiable w.r.t. 6, that is, iiHQ-^[z) = H0^{z) 
for almost all z{G), then 9i = 92- 

(m3) For some positive continuous function ^ on T and for some < /? < 1, 
\He,{z) - Hg,{z)\ < \\92-eife{z),y9i,92£Q,z£l. 

For every z, Hg{z) is differentiable in in a neighborhood of with the 
vector of derivative Hg{z) satisfying the following three conditions: 

(m4) y0<5n^0 

\He{Z,)-He,{Zi)-{9-9oyHe,{Zi)\ _ 
sup II „ „ II — Opyi). 

l<i<n,||6»-6»o||<5n If "''Oil 

(m5) VO < A; < oo 

sup h~''/^\\He{Z,)-He,{Z,)\\=Op{l). 

l<i<n,y/nh^\\e-eo\\<k 

(m6) / \\HeQ \\'^ dC < oo and Sq : = / HqqH'q^ dG is positive definite. 

For later use we note that, under (h2) and (m4), nh'^ oo and for every 
< < oo, 

/r,-,N \He{Zi) - Heo{Zi) - {9 - 9q)' He^^{Zi)\ _^ 

l^-lj sup \\9-94 " 



l<i<n,Jnhi\\e~eo\\<k 
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The above conditions are similar to those imposed in K-N on the model 
1710. Consider the following conditions in terms of the given model: 

(m2') The parametric family of models me(x) is identifiable w.r.t. 9, that 
is, if mg^ (x) = mg^{x) for almost all x, then 6i = 02. 

(m3') For some positive continuous function L on M"^ with EL{X) < oo 
and for some /3 > 0, Img^ix) - mg^{x)\ < \\92 - 9i\f L{x), \/ei,92 e e,x eM.'^ . 

The function mg{x) is differentiable in in a neighborhood of 9q, with 
the vector of differential mg^ satisfying the following two conditions: 

(m4') V0<(5„^0 

\mg{x) - mg^{x) -{9- 9Q)'mg^{x)\ 

x<m.'^,\\e-eo\\<5n 
(m5') For every < A; < oo 

sup h~'^/'^\\mg{x) - mg^{x)\\=o{l). 

xGR'',-v/n/if||6»-6»o||<fe 

In some cases, (m2) and (m2') are equivalent. For example, if the family of 
densities {fn{- — z);z £ M} is complete, then this holds. Similarly, if mg{x) = 
9'j{x) and / 7(x)/^(x — z)dx ^ 0, for all z, then also (m2) and (m2') are 
equivalent. 

We can also show that (m3')~(m5') imply (m3)-(m5), respectively. This 
follows because Hg{z) = / mg{x)fr,{x — z) dx, so that under (m3'), \Hg^{z) — 
Hg,{z)\ < \\92 - 9if J L{x)fr^{x - z)dxyz e W^. Hence (m3) holds with 
l{z) = J L{x)frj{x - z) dx. Note that Ee{Z) = EL{X) < oo. 

Using the fact that / fn{x — z) dx = l, the left-hand side of (m4) is bounded 
above by sup^gKd^l|0_eg||<5|me(x)-meo(x) -(6*- 6lo)'m(,(,(x)|/||6l- 6*011 =o(l), 

by (m4'). Similarly, (m5') implies (m5) and (ml) implies that Hg{z) is a.s. 
continuous in z{G). 

The conditions (ml)-(m6) are trivially satisfied when mg{x) = 9'j{x) pro- 
vided components of E[y{X)\Z = z] are continuous, nonzero on 2 and the 
matrix / E[y{X)^' {X)\Z = z]dG{z) is positive definite. 

The conditions (el), (e2), (fl), (k), (ml)-(m3), (hi) and (h2) suffice for 
consistency of 9n, while these plus (e3), (f2), (m4)-(m6) and (h3) are needed 
for the asymptotic normality of The asymptotic normality of Mn{9n) 
needs (el)-(e4) and (fl)~(m6) and (h3). Of course, (h3) implies (hi) and 
(h2). 

Let Qhi '■= fz/fzh — 1- From [15] we obtain that under (fl), (k), (hi) and 
(h2), 

^^V>\fzh{z) - fz{z)\ =Op(l), SM^\fzw{z) - fz{z)\ =Op(l), 

(2.2) 

sup 1 5/11(2;) I = Op(l) =SUp|g^^,l(z)|. 
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These conclusions are often used in the proofs below. 

In the sequel, the true parameter is assumed to be an inner point of 
and ( :=¥ — Hq^{Z). The integrals with respect to G are understood to 
be over I. The convergence in distribution is denoted by -^d and Mp{a,B) 
denotes the p-dimensional normal distribution with mean vector a and co- 
variance matrix B, p>l. We shall also need the following notation: 

di^iz) := aliz) := Y^re.m = z) = + r\z), 

Q: = Yi-He,{Zi), l<i<n, 

n . 



(2.3) 



1=1 ■ 

K2{v): = j K{v + u)K{u)du, \\K2f := J K^{v)dv, 

T: = 2\\K2f j{al[z)fg{z)di^{z), 
qn{z):={fUz)/fL{z))-l, 

I n 1 ^ 

^In{z,e)■.= -Y,Kh^{z)HgiZi), ^in{z ,0) := -Y^Khi{z)He{Z,) , 

n ^ n ^ 

1=1 1=1 

1 " 

(2.4) Un{z,e):=-y^Ku{zm-He{Zi)i Un{z) := Un{z,do), 
1=1 

1 " 

Z„(z, e):=-Y^ Km{z) [He{Z,) - He, {Z.{)], e M", z G M'^. 

Tl . 
1=1 

These entities are analogous to the similar entities defined at (3.1) in K-N. 
The main difference is that there is replaced by Hg and XiS by Zj's. 

3. Consistency of 0^ and On. Recall (1.5). In this section we first prove 
consistency of 0* and 0„ for T{H), where H corresponds to a given regression 
function m. Consistency of these estimators for under Tio follows from 
this general result. The following lemma is found useful in the proofs here. 
Its proof is similar to that of Theorem 1 in [2] . 

Lemma 3.1. Under the conditions (m3), the following hold: 

(a) T{v) always exists, for all v S L2{G). 

(b) If T{v) is unique, then T is continuous at v in the sense that for any 
sequence o/{f„} G L2{G) converging to v in L2{G), r(i/„) -^T{v), that is, 
/9(m„, z^) ^ implies T{un) — > 
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(c) In addition, if (m2) holds, then T{Hq) = 9, uniquely for all 9 (zQ. 

From now on, we use the convention that for any integral J := J r dip, J : = 
frdip. Also, let 7^(2) := E[{m{X) - H{Z)f\Z = z], z ^ W^. A consequence 
of the above lemma is the following. 

Lemma 3.2. Suppose (k), (fl), (m3) hold and m is a given regression 
function satisfying the model assumption (1.1), H & L2{G) and T{H) is 
unique. 

(a) In addition, suppose H and j"^ are a.e. {G) continuous. Then, 9^ = 
T{H) + Oj,{l). 

(b) In addition, suppose m is continuous on 2. Then, 9n = T{H) + Op{l). 
Proof. 

Proof of part (a). We shall use part (b) of Lemma 3.1 with Un = Hn 
and V = H. Note that M*(6l) = p{Hn,He), 9^ = T{Hn). It thus suffices to 
prove 

(3.1) p{Hn,H) = Op{l). 
Let := Yi - H{Zi), l<i<n, 

n 

1=1 

n 

(3.2) H{z):=n-^Y.^hi{z)H{Zi), z 

i=l 

An:= j[H- fz^Hfd^p. 

To prove (3.1), plug Yi = S^i + H{Zi) in p{IIn, H) and expand the quadratic 
integrand to obtain that p{IIn,H) < 2[J U^dip + A„]. By Fubini's theorem 
and orthogonality of Zi and ^j, 

(3.3) E I Ul{z)di,{z)=n-^ j E{kI{z - Z){al + ^\Z))} diP{z). 

By the continuity of fz [cf. (fl)], by a.e. continuity of 7^ and by (k), we 
obtain, for j = 0, 2, that 

EKliz - Z)^\Z) = -1 1 K\y)fz{z - yh)jHz - yh) dy = o(^) . 
These calculations, the bound J U!^ dip < sup^f^j{J^^)^ J dip and (2.2) 

jZw [Zj 

imply that 

(3.4) E J Ul dip = O (;^) and J dip = Op 
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Next, we shall show that 
(3.5) A„ = Op(l). 

Toward this goal, add and subtract H[z)E{fzw{z)) = H{z)E{K'^{z — Z)) 
and E[H{z)) = E{Kf^{z — Z)H{Z)) in the quadratic term of the integrand in 

A„,to obtain A„ < 4[A„i + A„,2 + A„3], where A„i := ^[H - E{H)f d^^, ^n2 ■ 
Rfzn^ - E{fz^)?H^ di^, A„3 := J[E{H) - HE{fz^)f d^:. 

Fubini's theorem, (k), (fl) and H being a.e. {G) continuous imply 

EK,a < n-i J E[Kl{z - Z)H\Z)] di,{z) 

= {nh'^y^ [ [ K^{w)H^{z - wh)fz{z - wh) dwdij{z) 



= 0{{nh')-'). 

Because A„i < sup^(/^(z)//^^(2;))^A„i, the above bound and (2.2) yield 
that A„i = Op{{nh'^)~^). Similarly, one shows that A.„2 = Op{{nh'^)~^). 
Next, H being a.e. (G) continuous and (fl) yield 



^n3 = J J [K{u)H{z - hu) - H{z)K*{u)]fz{z - hu) du 



dil){z)^Q. 



Hence, by (2.2), A„3 = Op(l). This completes the proof of (3.5) and hence 
that of part (a). 

Proof of part (b). Consistency of 0„ for under Ti^ can be proved by 
using the method in [14]. But that method does not yield consistency of On 
for T(H) when = m, M. The proof in general consists of showing 

(3.6) snp\Mni9)-p{H,He)\=Op{l). 
6»ee 

This, (m3) and the continuity of m on I imply that H is continuous and 
\p{H,He2) - p{H,He,)\ < C\\ei - , ^61,62 € Q, which in turn implies 
that for all e > 0, 

(3.7) limlimsupPf sup |M„(6li) - M„(6'2)| > e ) = 0. 



5^0 



9i-6»2||<5 



These two facts in turn imply 0„ = T{H) + Op{l). For, suppose 0„ ^ T{H), 
in probability. Then, by the compactness of O, there is a subsequence {On^} 
of {9n} and a 6* ^ T{H) such that = 6* + Op{l). Because M„J^„J < 
Mnf,{T{H)), we obtain 

p{H, He*) < p{H, Ht^h)) + 2sup |M„, {0) - p{H, He)\ 

6 

+ \Mn,{e*)-Mn,{en,)\. 
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By (3.6) and (3.7), the last two summands in the above bound are Op(l), so 
that p{H,Hg*) < p{H,H j^j^jj^) eventually, with arbitrarily large probability. 
In view of the uniqueness of T{H), this is a contradiction unless 9* = T{H). 

To prove (3.6), use the Cauchy-Schwarz (C-S) inequality to obtain that 
l-^n(^) ~ p{H,Hg)\ is bounded above by the product Qni{9)Qn2{9), where 



Qni{0) :-- 

Qn2{0) :-- 



[Hn{z)-H{z)] 
[Hn{z) + H{z)] 



fJ'n{z,9) 
fZw{z) 
fJ'n{z,9) 
fZw{z) 



He{z) 



+ He{z) 



dG{z), 



dG{z). 



But Qni{9) is bounded above by 2{p{Hn, H) + A„(0)), where An{9) is the 
A„ of (3.2), with H replaced by He- By (3.5), A„(6') = Op(l), for each 6* G 9. 



Similarly, V^i, ^2 G0, |A„( 

'Pn{z,Oi) Pniz, 



- A„(02)| is bounded above by the product 



fZw{z) fzw{z) 



fJ.n{z,9l) _^ Hn{z 



92) 



He,{z)]] dG{z 



[HeAz) + He,{z)]] dG{z). 



fZw{z) fzw{z) 

By (m3) and (2.2), the first term of this product is bounded above by 116*1 — 
(^2\\^^Op{l), while the second term is Op(l) by the boundedness of mg{x) 
on X X G. These facts, together with the compactness of 0, imply that 
supg^QQni{9) = Op(l) while mQ{x) bounded on X x G implies that 
supggQ (5n2(^) =Op(l), thereby completing the proof of (3.6). □ 



Upon taking m = nig^^ in the above lemma one immediately obtains the 
following. 



Corollary 3.1. Suppose Hq, (el), (e2), (fl) and (ml)-(m3) hold. Then 
0* — > ^0; 9n^ 9q, in probability. 

4. Asymptotic distribution of On and X>ji. In this section, we sketch a 
proof of the asymptotic normality of \/n{9n — 9o) and Vn, under Tio. This 
proof is similar to that given in [14]. We indicate only the differences. To 
begin with we focus on 9n- The first step toward this goal is to show that 

(4.1) nh''\\9n-9o\\^ = Opil). 

Let Dn{9) = J Z^{z,9) dil'{z). Arguing as in K-N, one obtains 



(4.2) 



nh''DM = Op{l). 
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Next, we shall show that for any a > 0, there exists an Na such that 
(4.3) p(Dn{en)/\\en-eof>a+ ini b^^ob) >l-a Vn > iV„ 



where Sq is as in (m6). The claim (4.1) then follows from (4.2), (4.3), (m6) 
and the fact nh'^DniOn) = nh'^pn - ^of [^n(^n)/||4 - ^of ]■ 
To prove (4.3), let S„(6) := /[6'/i„(z, 6lo)]2 (/'(^(z), beW and 



'•n ■ — '-"n 



dni := Hq (Zi) - HoaiZi) - u'^HoaiZi), 



(4.4) 



1 < i < n. 



D 



nl 



1=1 



dijiz) 



Dn2 

Note that 



7n — PO I 



\Un\\ 



di){z). 



We remark here that this inequality corrects a typo in [14] in the equation 
just above (4.8) on page 120. Assumption (m4) and consistency of On imply 
that Dni = Op(l). Exactly the same argument as in [14] with obvious mod- 
ifications proves that supy^n^]^ l|Sn(&) — ^'5^o&|| = Op(l) and (4.3), thereby 
concluding the proof of (4.1). As in [14], this is used to prove the following 
theorem where 

{al + T'{u))HeMHL{u)g\u) 



fz{u) 



■ du. 



Theorem 4.1. Assume (el)-(e3), (fl), (f2), (g), (k), (ml)-(m5) and 
(h3) hold. Then under Hq, 'n}^'^{9n — 0o) = '^Q^n^^'^ Sn + Op{l). Consequently, 
n^/"^ X {On — Oq) — >rf Nq{0, Sq ^SSq ^), where T,q are defined in (m6). 

This theorem shows that asymptotic variance of n^/'^{On — Oq) consists 
of two parts. The part involving reflects the variation in the regression 
model, while the part involving reflects the variation in the measurement 
error. This is the major difference between asymptotic distribution of the 
MD estimators discussed for the classical regression model in the K~N paper 
and for the Berkson model here. Clearly, the larger the measurement error, 
the larger will be. 
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Next, we state the asymptotic normality result about Its proof is 
similar to that of Theorem 5.1 in [14] with obvious modifications and hence 
no details are given. Recall the notation in (1.4). 

Theorem 4.2. Suppose (el), (e2), (e4), (fl), (f2), (g), (k), (ml)-(m5) 
and (h3) hold. Then under TLq, — >^ A/i(0,r) and ir^F"-^ ~ 1| = Op(l). 

Consequently, the test that rejects whenever > 2:^/2 is of the 
asymptotic size a, where is the 100(1 — a)% percentile of the standard 
normal distribution. 



5. Power of the MD-test. We shall now discuss some theoretical re- 
sults about asymptotic power of the proposed tests. We shall show, un- 
der some regularity conditions, that ^ 00, in probability, under cer- 
tain fixed alternatives. This in turn implies consistency of the test that 
rejects TIq whenever is large against these alternatives. We shall also 
discuss asymptotic power of the proposed tests against certain local alter- 
natives. Accordingly, let m S L2{G) and H{z) := E{m{X)\Z = z). Also, let 
viz, 9) := Hg{z) - H{z), e.„i := Yi - Hg^{Zi), := Yi - H{Zi), where On is 
an estimator of T{H) of (1.5). Let, for z E G 6, 

-j^ n 1 " 

Vn{z) := - V-K'/ii(z)ei, v'n{z,0) := - y2Khi{z)u{Zi,0). 

n ^ n ^ 

1=1 1=1 

5.1. Consistency. Let Vn := nh'^^'^Qn ^^"^ {Mn{On) — Cn), where 

i=i i^j ^ 

If On = On, then C„ = Cn,Gn = and T>n = T^n- The following theorem 
provides a set of sufficient conditions under which ^ 00, in probability, 
for any sequence of consistent estimator On of T{H). 

Theorem 5.1. Suppose (el), (e2), (e4), (fl), (f2), (g), (k), (m3), (h3) 
and the alternative hypothesis 7ii:^{x) = m{x),\/x £2 hold with the addi- 
tional assumption that inig p{H,Hg) > 0. Then, for any sequence of consis- 
tent estimator On ofT{H), — > 00, in probability. Consequently, {Vnl — > 
00, in probability. 

Proof. Subtracting and adding H{Zi) from e„j, we obtain Mn{On) = 
Sni - 25„2 + 5'„3, where Sni := JVldip, Sn2 := J Vniz)i^n{z,0n) dip{z) and 
Sn3 ■= J i^ni^^^n) dtp{z). Arguing as in Lemma 5.1 of [14], we can verify 
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that under the current setup, nh^/'^{Sni — C*) -^i(0,r*), where C* = 
E'LilKliz)ejdi,iz)/n\T* = 2jiaUz)rg{z)d^iz)\\K2r^ 
^\z), with a^^iz) = E[{Y - H{Z)f\Z = z\. 

Next, consider 5.„3. For convenience write T for T{H). By subtracting and 
adding HxiZi) from z^(Zj, we have Sns = Sn3i + 25^32 + -^nss, where 



Sn3i ■= J K{z,T)d'4){z), 

Sn32 ■■= J i>niz,T)[fi„{z,9n) - finiz , T)] dtp (z) , 
5*7133 := / [fin{z,On) - fln{z , T)f dl^) (z) . 



Routine calculations and (2.2) show that 5^31 = p{H , Ht) + Op{l) , underHi. 
By (m3), 5„33 < ||0„ -Tf /^[-^ ELi ^/.. WIK^OIl' ^G(z) = Op(l), by 
consistency of 9n for T. By the C-S inequality, one obtains that Sn32 = 
Op(l) = Sn2- Therefore, Sns = piH , Ht) + Op{l) . 
Note that 



Cn-C: = - — J2 f Kl{z)eiu{ZM d^'iz) 
^ i=i'' 

1 " /■ 

+ -E Kl{z)u\ZMd^P{z). 

Both terms on the right-hand side are of the order Op(l). 

We shall next show that Qn — > T* in probability. Adding and subtracting 

H{Zi) and H{Zj) from e^j and Cnj, respectively, and expanding the square 

10 A . 

j=l ) 

\ ^ 

Z] 



of integral, one can rewrite Qn = J2]=i ^nj, where 



Ani = 2h'^n f Khi{z)Khj{z)eiejdi!{i 

An2 = 2K'n-^Y.{j KM{z)Khj{z)eiiy{Zj,en)di^{z)Y , 
An3 = 2h'^n-^J2{J KM{z)Khj{z)u{Zi,en)ejd^{: 
AnA = 2/i'^n-2 ^ (j Ku{z)Kh,{z)u{ZMu{Zj,en) di^{z)\\ 
An^ = -^h'^n~^^(^J KM{z)Khj{z)eiejdTp{z) 
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X J Khi{z)Khj{z)eiv{Zj,9n)d'tp{z) 
= -4/i'^n~^ ^ ( / Khi{z)Khj{z)eiej di;{z) 

X j Khi{z)Khj{z)i'{Zi,en)ej dip{z) 
An7 = 4:h'^n-^"^( / Khi{z)Khj{z)eiejdi!{z) 

X jKhi{z)Khj{z)v{Zi,en)v{Zj,en)d'^{z 
Anf, = Ah\-^Y.{j KUz)Khj{z)eMZj,en)di^iz) 

X j Khi{z)Khj{z)v{Zi,en)ejd-^{z)^, 
An^ = -Ah'^n-^Y.il KM{z)Khj{z)e^v{Zj,9n)d^{z) 

X j Khi{z)Khj{zMZi,,9MZj,0n)di};iz)], 
Anio = -^h'^n-^Y.^j KUz)Khj{z)u{Z„en)ejd^P{z) 



X / Khiiz)Khjiz)i^{Zi,en)i^{Zj,en)d^iz) 



By taking the expectation, using Fubini's theorem we obtain 

2 



(5.1) h''n~^Y.( [ Khi{z)Khjiz)\e^\\ej\dij{z)) =Op(l), 

(5.2) h''n~^Y.(J ^h^{z)KhJ{z)\e,\''di;{z)y = Op{l), k = 0,l. 

By (2.2) and (5.1) and arguing as in the proof of Lemma 5.5 in K-N, one 
can verify that A„i ^pT\ := 2 /(cjf 4)(z) dz||i^2f • 

JZ 

Add and subtract HxiZj) from u[Zj,9n), to obtain 
^n2 = -^^{j Khi{z)Khj{z)eiu{Zj,e) dip{: 
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d 



Ah". ^ / /• 

+ — E / Kh^{z)Khj{z)e^u{Zj,e)diJ{z) 

X j Khi{z)Khj{z)e,{HeSZj) - HT{Zj))di,{z 
+ ^E(y KUz)Khj{z)e,{He„{Zj) - HT{Zj))di;iz 



By (m4), consistency of (2.1), the C-S inequality on the double sum 
and (5.2), the last two terms of the above expression are Op(l). Arguing, as 
for Ani, the first term on the right-hand side above converges in probability 

to : = 2/ a'^{z)[H{z) - Hriz)]'^ €^ dz\\K2\\'^ . Similarly, one can also show 

An3 ^2 in probability. 

Similarly, by adding and subtracting HTiZi), H'r{Zj) from ^[Zi^Bn), 
^{ZjjOn), respectively, in A^^, one obtains A^i = Tg + Op(l), where Fg = 

2 J[H{z) - Ht{z)]'^4t\ dz\\K2\\^ . Next, rewrite 
An5 = -4.h'^n-^J2( I Khiiz)Khjiz)eiejd4>{z) 

X J Khi{z)Khj{z)eiu{Zj,0n) dipi^z) 
- Ah'^n-'' T. (J^h^ {z)Khj {z)e,ej d^{z) 

X J KM{z)Khj{z)e^\HeSZj) - Ht{Z,)] dilj{z) 

= ^n51+^n52, Say. 

Clearly, EAn^i = 0. Argue as for (5.13) in K-N, verify that E{Al^^) = 
0{{nd)~^). Therefore, An5i = Op(l). By the C~S inequality on the double 
sum, (2.2), (5.1) and (5.2), we have A^^i = An5i + Op{l). Hence An^i = Op{l). 
Similarly, one can verify An52 = Op{l). These results imply An5 = Op(l). 

Similarly, one can show that Ani = Op{l), i = 6,7, 8, 9, 10. Note that T* = 
Tl + + Tg, so we obtain that Gn ^ T*, in probability. 

All these results together imply that 

Vn = nh''/'t-'/\Sni - Q) + nh^/^g~'/^p{H, Ht) + Op{nh^'^), 
hence the theorem. □ 



5.2. Power at local alternatives. Here we shall now study the asymptotic 
power of the proposed MD-test against some local alternatives. Accordingly, 



18 



H. L. KOUL AND W. SONG 



let r be a known continuously differentiable real-valued function and let 
R{z) := E{r{X)\Z = z). In addition, assume R G L2{G) and 



(5.3) J HeRdG = \/0e&. 

Consider the sequence of local alternatives 



(5.4) Wi„,:/i(x) = 7720(3 (x) +7„r(a;), 7„ = 

The following theorem gives the asymptotic distribution of On under TCin- 



Theorem 5.2. Suppose (el)-(e3), (fl), (f2), (g), (k), (ml)-(m5) and 
(hS) hold; then under the local alternative (5.3) and (5.4), n^^'^{9n — Oq) — >d 
7V,(0,So-iSSoi). 

Proof. The basic idea of the proof is the same as in the null case. We 
only stress the differences here. Under Tiin, Ei = Yi — mg^^{Xi) — 7„r(Xj). 
Let rniz) :=j:7=iKhi{z)r{Xi)/n. 

We first note that nh'^Mn{9o) = Op{l). In fact, under (5.4), M„((9o) can 
be bounded above by 2 times the sum of Jrldi} and 



J2Khiiz){me,(Xi) + e^ - He,{Z,)) 



.i=l 



2 

d'4>{z). 



Using the variance argument and (2.2), one verifies that this term is of the 
order Op{n~^h~'^). Note that r„ is a kernel estimator of R. Hence, R G L2{G) 
and a routine argument shows that the former term is Op{n~^h~'^^'^). This 
leads to the conclusion nh'^Mn{0o) = Op{l). This fact and an argument 
similar to the one used in K-N, together with the fact On — >p ^O) yield 
nh'^\\9n - ^o|| = Op{l), under Tiin- 

Note that with M„(6') := dMn{9)/d9, 9n satisfies 

(5.5) Mni9n) =-2 J Uniz, 9n)fj^n{z, k) di^{z) = 0, 

where Un{z,0) and (jin{z,9) are defined in (2.4). Adding and subtracting 
HQ^{Zi) from Yi — (Zi) in Un{z,On), we can rewrite (5.5) as 

(5.6) J Un{z)fin{zJn)d'4){z)=JZn{z,0n)flniz,9n)dtlj{z). 

The right-hand side of (5.6) involves the error variables only through 9n • 
Since under Hin we also have n^/'^{9n — 9q) = Op(l), its asymptotic behavior 
under TLin is the same as in the null case, that is, it equals lZn{9n — 9o) + 
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op{l),Tln = So + The left-hand side, under (5.4), can be rewritten as 

Sni + Sn2, where 

5„i = / - '^Khi{z)[m0g{Xi) + ei- Heo{Zi)]fin{z,On)dip{z), 



1=1 



Sn2 = ln J rn{z)fln{z,6n)d'li){z). 

Note that m0Q{Xi) + ej — HQ^^[Zi) are i.i.d. with mean and finite second 
moment. Arguing as in the proofs of Lemmas 4.1 and 4.2 of [14] with £i 
there replaced by mQf^{Xi) + Ei — HQ^{Zi) yields that under TLin, ^/nSnl — >d 
Nq{d, S). Thus, the theorem will be proved if we can show \fnSn2 = Op(l)- 
For this purpose, with r/i(z) := EKh{z — Z)r{X) = EKh{z — Z)R{Z), we 
need the following facts. Arguing as for (3.4) and using differentiability of 
r, one obtains 



(5.7) 



a - rh? # = Op{n-^h-''), j [rh - Rfz? d^ = Oik""), 

I \\f,!,-HgJzfdi; = 0{h^''). 

Then the integral in Sn2 can be written as 

\[r.^{z) - r^iz)] + [rhiz) - R{z)fz{z)] + R{z)fz{z)} 

X {[fln{z,9n) - flniz^Oo)] + [/i„(z,0o) " f^h{z)] 

+ [fih{z)-Heo{z)fz{z)\+He,{z)fz{z)]d^{z). 

This can be further expanded into twelve terms. By (m5), (2.2), (5.7) and 
C-S, one can show that all of these twelve terms are Op{h~'^^^) except the 
term / RHg^f'^d'ip = J RHeodG + J RHe^qndG. But (5.3), (m6), continuity 
of r{x) and the compactness of I imply J Hg^RdG = 0. The second term is 
bounded above by 

(5.8) sup \qniz)\ [ \R\\\HgJdG. 

By Theorem 2.2, part (2) in Bosq [4] and the choice of w = (l2|!!)i/(<i+4) ^ 
(log^n)~^(n/logn)^/('^+^) sup^gj |/^^(z) — fz{z)\ — > 0, almost surely, for all 
A; > 0. This fact and (h3) readily imply that (5.8) is of the order Op{h^/^), so 
that vi}l'^Sn2 = \fn • {Vnh^/'^)~^ ■ Op{h'^/^) = Op(l). Hence the theorem. □ 

The following theorem gives asymptotic power of the MD-test against the 
local alternative (5.3) and (5.4). 
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Theorem 5.3. Suppose (el), (e2), (e4), (fl), (f2), (g), (k), (m4), (h3) 
and the local alternative hypothesis (5.3) and (5.4) hold. Then, T>n -^d 
N{T-^/^ jR'^dG,!), where T is as in (2.3). 

Proof. Rewrite MM = r„i + 2r„2 + r„3, where r„,i := J di^ , Tn2 := 
JUniz)[finiz,Oo) -j-hi{zjn)]d'ip and Tn3 := J[nn{z,9o) - Hniz,On)? dij). By 
Theorem 5.2, \fn(Qn — ^o) = Op(l). This fact, (m4) and (2.2) imply T„,3 = 
Op(n-i). 

Next, we shall show that T„2 = Opi^n'^h-'^l^). By C-S, Tl^ < TniTns- 
Moreover, Tni = J U'^dil) + J U^gndifj. But under T^i^, Yi = mgg{Xi) + 'ynr{Xi) + 
Ei. Hence, / U"^ dip is bounded above by 3 times the sum 

n. "1 2 



1 



nfziz^ 



Z Ei 



i=l 



dG{z) + fldil) 



n 2 



■Y,KM{z)[meMi)-He,{Zi)] 



dG{z). 



nfziz) .^-^ 

Arguing as in Section 2, all of these terms are Op{n-^h-'^/^). This fact 
and (2.2) imply that the second term in Tni is of the order Op{n~^h~'^/'^). 
Hence Tni = Op{n-^h-'^/^) and r„2 = Opin-^h''^/^). 

We shall now obtain a more precise approximation to T„i. For this pur- 
pose, write ^i = ei+ me^^{Xi) - Hg^{Zi) and let Vn{z) := J27=i Khi{z)ii/n. 
Then, r„i = T^n + 27„r„i2 + 7n^ni3, where 



tnl2 



Vnrndii, 



Tni3 ■■= j rld-tp. 



Now, we shall show that 



(5.9) 



{R/fz^)VndG = Op{l/^/nhd/^). 



In fact, with dij^i ■.= dG/fz, the left-hand side equals J{R/ fz)VndG + 
J RVnQwid^i. The first term is an average of i.i.d. mean-zero r.v.'s and 
a variance calculation shows that it is of the order Op(n~^/^), while by The- 
orem 2.2, part (2) in Bosq [4], the second term is of the order Op{l/Vnh^), 
thereby proving (5.9). 

Arguing as for (3.4) one obtains that J di/j = Op{l / nh'^) . Next, note 
that fn/ fzw is an estimator of R, so by the C-S inequality again. 



{Vn/fZwWn/fzn.) - R]dG = Opil/^nh^l^) 



This fact and (5.9) imply that T„,i2 = Op{l/v nh'^/'^). A similar and relatively 
easier argument yields that Tni3 = J dG + Op{l). 
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Finally, we need to discuss asymptotic behavior of C„ under the local al- 
ternative (5.4). With Ci = >i - HeoiZi), rewrite Yi - HeS^i) = Q + He^iZi) - 
Hg^{Zi) in Cn, to obtain 

iij . ^ J 
1=1 

2 f 

+ -E / Kl{z)Ci{He,{Z,)-He„{Z,))di^{z) 
1 " /■ 

+;;2E / Kliz)iHe,iZ,)-HeAZ,)fd^l;iz) 
But with notation at (4.4), 

n „ in 



I f -If 

Cn2 = 2E / Kli[z)iidnidi){z) ^E / Kli{z)r{Xi)dni dipiz) 

1=1 J=l 

1 " /■ 

+ -E / Kl{z)i,u'^He,{Z,)d^{z) 
1 " /■ 

+ -E / ^^^WK^^)<^eo(^.)#(^)• 
71/ ■ -. J 



Recall that 7„ = Using assumptions (m4), (h2), one can show 

the first and the third terms in C„2 are of the order Op(n~^/^/i~'^), the 
second and the fourth terms are of the order Op(?i~^/i~^"'/^). This implies 
Cn2 = Op(7^). Similarly, one can show that Cn3 = Op{n~^/'^h~'^) = Op(7^). 

Since Yi - He,{Z,) = + 7nr(Xi), if we let = n'^ ^^i / ^hiC' ^V^, 
then using the similar argument, we can show that C„i = Dn + Op(7^). 

To see the asymptotic property of r„ under the local alternative, adding 
and subtracting Hg^{Zi), HgQ{Zj) from e„i and Sni, respectively, and letting 

= inef^{Xi) - Hg^{Zi) + £i, we wiU arrive at 

f„ = 2n-^K'Y.(j Kh,{z)Khj{z)iiijdi:{z)\\oJn. 

The first term converges in probability to T. The remainder uJn = Op{l) can 
be proven by using the C-S inequality on the double sum, consistency of On, 
(2.2) and the following facts: 

^E(/ K„,{z)Khj{z)M^,\di;{z)y = Op{l), 
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A; = 0,1. 



Therefore, under the local alternative hypothesis (5.4), 
nh'^/^t-'/^MM - Cn) = nh''/^t-'/\Tnu - D^) + t;,^'^TnVi + Op(l), 

which, together with the fact nh'^/^{Tnn - Dn) -^d iVi(0,r), T^ig J R"^ dG 
and ^ r in probability, implies the theorem. □ 

6. Simulations. This section contains results of two simulation studies 
corresponding to the following cases: case 1: d = q = l and linear; case 2: 
d = q = 2 and me nonlinear. In each case the Monte Carlo average values of 
6n., MSE(0„), empirical levels and powers of the MD test are reported. The 
asymptotic level is taken to be 0.05 in all cases. 

In the first case {Zj}^]^ are obtained as a random sample from the uniform 
distribution on [—1, 1] and {ei}"^i and {?7j}"=i are obtained as two indepen- 
dent random samples from A/i(0, (0.1)^). Then {Xi,Yi) are generated using 
the model Yi = n{Xi) + ei,Xi = Zi + r]i,i = 1,2, ... ,n. 

The kernel functions and the bandwidths used in the simulation are 



Kiz) = K*iz) = l{l-z')Ii\z\<l) 



h- 



n 



1/3 ' 



w ■ 



logn\ 



n 



with some choices for a and h. The integrating measure G is taken to be the 
uniform measure on [—1, 1]. 

The parametric model is taken to be msix) = 9x,x,9 G M, = 1- Then, 
Hq{z) = 9z. In this case various calculations simplify as follows. By taking 
the derivative of Mn{9) in 9 and solving the equation of dMn{9)/d9 = 0, we 
obtain 0„ = An/Bn, where 

-2 



Ar, 



B„ 



Y.KM{z)Yi 



.1=1 

n 



J2^hiiz)Zi 



.1=1 



Y.Khi{z)Z 

1=1 

n 



YKwi{z) 



.1=1 



.i=l 



dz. 



dz. 



Then, with ii := Yi — 9nZi, 

„i / n 



Mn{9n) = I ^ (j2Khi{z)iij [J^K^z)^ 

Cn = \Y.Kl{z)eU [i^KUA dz 



dz. 
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Table 1 reports the Monte Carlo mean and MSE(0„) under Hq for the 
sample sizes 50,100,200,500, each repeated 1000 times. One can see there 
appears to be little bias in On for all chosen sample sizes and as expected, 
the MSE decreases as the sample size increases. 

To assess the level and power behavior of the P„-test, we chose the follow- 
ing four models to simulate data from; in each of these cases Xi = Zi + r]i: 

Model 0: Yi = Xi + ei, 

Model 1: Yi = Xi + 0.3Xf + e^, 

Model 2: Yi = X^ + 1.4exp(-0.2X2) + e^, 

Model 3: Yi = XiI{X^ > 0.2) + Ei. 

To assess the effect of the choice of (a, b) that appear in the bandwidths 
on the level and power, we ran simulations for numerous choices of (a, 6), 
ranging from 0.2 to 1. Table 2 reports these simulation results pertaining 
to Vn for three choices of (a, 6). Simulation results for the other choices 
were similar to those reported here. Data from Model in this table are 
used to study empirical sizes and data from Models 1 to 3 are used to 
study empirical powers of the test. These entities are obtained by computing 

#{\Vn\ > 1.96}/1000, where P„ := nh'^/^tn^^^MM - Cn). 

From Table 2, one sees that empirical level is sensitive to the choice of 
(a, b) for moderate sample sizes (n < 200) but gets closer to the asymptotic 
level of 0.05 with the increase in the sample size and hence is stable over 
the chosen values of (a, b) for large sample sizes. On the other hand the 
empirical power appears to be far less sensitive to the values of (o, b) for 
the sample sizes of 100 and more. Even though the theory of the present 
paper is not applicable to Model 3, it was included here to see the effect of 
the discontinuity in the regression function on the power of the minimum 
distance test. In our simulation, the discontinuity of the regression has little 
effect on the power of the minimum distance test. 

Now consider the case 2 where d = 2,q = 2 and mg{x) = OiXi -|-exp(^22;2)i 
= (6*1, 6*2)' G K^, xi,X2 G M. Accordingly, here Hg{z) = Oizi + exp(6l22:2 + 
0.005^2)- The true = (li2)' was used in these simulations. 

In all models below, {Zi = {Zu, Z2i)'Yi=\ obtained as a random sam- 
ple from the uniform distribution on [—1,1]^, {ej}"^]^ are obtained from 



Table 1 
Mean and MSE of 6-. 



Sample size 


50 


100 


200 


500 


Mean 


1.0003 


0.9987 


1.0006 


0.9998 


MSE 


0.0012 


0.0006 


0.0003 


0.0001 
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Table 2 

Levels and powers of the minimum distance test 



Model 


a, b 




Sample 


size 




50 


100 


200 


500 




0.3, 0.2 


0.007 


0.026 


0.028 


0.048 


Model 


0.5, 0.5 


0.014 


0.022 


0.040 


0.051 




1.0, 1.0 


0.021 


0.020 


0.031 


0.043 




0.3, 0.2 


0.754 


0.987 


1.000 


1.000 


Model 1 


0.5, 0.5 


0.945 


1.000 


1.000 


1.000 




1.0, 1.0 


1.000 


1.000 


1.000 


1.000 




0.3, 0.2 


0.857 


0.996 


1.000 


1.000 


Model 2 


0.5, 0.5 


0.999 


1.000 


1.000 


1.000 




1.0, 1.0 


1.000 


1.000 


1.000 


1.000 




0.3, 0.2 


0.874 


0.993 


1.000 


1.000 


Model 3 


0.5, 0.5 


1.000 


1.000 


1.000 


1.000 




1.0, 1.0 


1.000 


1.000 


1.000 


1.000 



M(0, (0.1)2) |,^^ ^ ir]u,miy}?= 

I are obtained from the bivariate normal 
distribution with mean vector and the diagonal covariance matrix with 
both diagonal entries equal to (0.1)^. We simulated data from the following 
four models, where Xi = Zi + r]i: 

Model 0: Yi = Xu + exp(2X2i) + Si, 

Model 1: Yi = Xu + exp(2X2i) + lAX^ + l + ei, 

Model 2: Yi = Xu + exp(2X2i) + lAXlxl + Si, 

Model 3: Yi = Xu + exp{2X2i) + 1.4(exp(-0.2XH) + exp(0.7X|)) + Si. 

Bandwidths and kernel function used in the simulation were taken to be 
h = n~^^^-^ ,w = n~^/^{logn)^/^ and 

K{z) = K*{z) = ^(1 - zl){l - zl)I{\zi\ < 1, \Z2\ < 1). 

The sample sizes chosen are 50, 100, 200 and 300, each repeated 1000 
times. Table 3 lists means and MSE of = (^ni, ^n2)' obtained by minimiz- 
ing Mn{6) and employing the Newton-Raphson algorithm. As in case 1, one 
sees little bias in the estimator for all chosen sample sizes. 

Table 4 gives the empirical sizes and powers of the P^-test for testing 
Model against Models 1-3. From this table one sees that this test is con- 
servative when sample sizes are small, while empirical levels increase with 
the sample sizes and indeed preserve the nominal size 0.05. It also shows that 
the MD test performs well for sample sizes 200 and larger at all alternatives. 
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Table 3 
Mean and MSE of 9, 



Sample size 


50 


100 


200 


300 


Mean of Oni 


0.9978 


0.9973 


0.9974 


0.9988 


MSE of Onl 


0.0190 


0.0095 


0.0053 


0.0034 


Mean of 6n2 


1.9962 


1.9965 


2.0013 


2.0004 


MSE of §n2 


0.0063 


0.0028 


0.0014 


0.0010 






Table 4 








Levels and powers 


of the minimum 


distance test in case 2 




Sample size 


50 


100 


200 


300 


Model 


0.003 


0.019 


0.049 


0.052 


Model 1 


0.158 


0.843 


0.979 


0.996 


Model 2 


0.165 


0.840 


0.976 


0.992 


Model 3 


0.044 


0.608 


0.954 


0.997 



Acknowledgment. Authors would like to thank the two referees and Jian- 
qing Fan for constructive comments. 

REFERENCES 

[1] An, H. Z. and Cheng, B. (1991). A Kolmogorov-Smirnov type statistic with appli- 
cation to test for nonlinearity in time series. Internat. Statist. Rev. 59 287-307. 

[2] Beran, R. J. (1977). Minimum Hellinger distance estimates for parametric models. 
Ann. Statist. 5 445-463. MR0448700 

[3] Berkson, J. (1950). Are these two regressions? J. Amer. Statist. Assoc. 5 164-180. 

[4] BOSQ, D. (1998). Nonparametric Statistics for Stochastic Processes: Estimation 
and Prediction, 2nd ed. Lecture Notes in Statist. 110. Springer, New York. 
MR1640691 

[5] Carroll, R. J., Ruppert, D. and Stefanski, L. A. (1995). Measurement Error in 
Nonlinear Models. Chapman and Hall/CRC, Boca Raton, EL. MR1630517 

[6] Cheng, C. and Van Ness, J. W. (1999). Statistical Regression with Measure- 
ment Error. Arnold, London; co-published by Oxford Univ. Press, New York. 
MR1719513 

[7] Fan, J. (1991a). On the optimal rates of convergence for nonparametric deconvolution 
problems. Ann. Statist. 19 1257-1272. MR1126324 

[8] Fan, J. (1991b). Asymptotic normality for deconvolution kernel density estimators. 
Sankhyd Ser. A. 53 97-110. MR1177770 

[9] Fan, J. and Truong, K. T. (1993). Nonparametric regression with errors in vari- 
ables. Ann. Statist. 21 1900-1925. MR1245773 
[10] Fuller, W. A. (1987). Measurement Error Models. Wiley, New York. MR0898653 
[11] Eubank, R. L. and Spiegelman, C. H. (1990). Testing the goodness of fit of a 
linear model via nonparametric regression techniques. J. Amer. Statist. Assoc. 
85 387-392. MR1141739 



26 



H. L. KOUL AND W. SONG 



[12] Hart, J. D. (1997). Nonparametric Smoothing and Lack-of-Fit Tests. Springer, New 
York. MR1461272 

[13] HuwANG, L. and Huang, Y. H. S. (2000). On errors-in-variables in polynomial 

regression— Berkson case. Statist. Sinica 10 923-936. MR1787786 
[14] KouL, H. L. and Nl, P. (2004). Minimum distance regression model checking. J. 

Statist. Plann. Inference 119 109-141. MR2018453 
[15] Mack, Y. P. and Silverman, B. W. (1982). Weak and strong uniform consistency 

of kernel regression estimates. Z. Wahrsch. Gebiete 61 405-415. MR0679685 
[16] RuDEMO, M., RuPPERT, D. and Streibig, J. (1989). Random effects models in 

nonlinear regression with applications to bioassay. Biometrics 45 349-362. 

MR1010506 

[17] Stute, W. (1997). Nonparametric model checks for regression. Ann. Statist. 25 613- 
641. MR1439316 

[18] Stute, W., Thies, S. and Zhu, L. X. (1998). Model checks for regression: An 
innovation process approach. Ann. Statist. 26 1916-1934. MR1673284 

[19] Wang, L. (2003). Estimation of nonlinear Berkson-type measurement errors models. 
Statist. Sinica 13 1201-1210. MR2026069 

[20] Wang, L. (2004). Estimation of nonlinear models with Berkson measurement errors. 
Ann. Statist. 32 2559-2579. MR2153995 



Department of Statistics 

AND Probability 
Michigan State University 
East Lansing, Michigan 48824-1027 
USA 

E-mail: koulOstt. msu.edu 



Department of Statistics 
Kansas State University 
Manhattan, Kansas 66506-0802 
USA 

E-MAIL: weixing@ksu.edu 



